A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models

نویسندگان

  • Masaki Katsumaru
  • Mikio Nakano
  • Kazunori Komatani
  • Kotaro Funakoshi
  • Tetsuya Ogata
  • Hiroshi G. Okuno
چکیده

The optimal combination of language model (LM) and language understanding model (LUM) varies depending on available training data and utterances to be handled. Usually, a lot of effort and time are needed to find the optimal combination. Instead, we have designed and developed a new framework that uses multiple LMs and LUMs to improve speech understanding accuracy under various situations. As one implementation of the framework, we have developed a method for selecting the most appropriate speech understanding result from several candidates. We use two LMs and three LUMs, and thus obtain six combinations of them. We empirically show that our method improves speech understanding accuracy. The performance of the oracle selection suggests further potential improvements in our system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving speech understanding accuracy with limited training data using multiple language models and multiple understanding models

We aim to improve a speech understanding module with a small amount of training data. A speech understanding module uses a language model (LM) and a language understanding model (LUM). A lot of training data are needed to improve the models. Such data collection is, however, difficult in an actual process of development. We therefore design and develop a new framework that uses multiple LMs and...

متن کامل

Online adaptation of language models in spoken dialogue systems

The robust estimation of language models for new applications of spoken dialogue systems often suffers from a shortcoming of training material. An alternative to training a language model is to improve an initial language model using material obtained while running the new system, thus adapting it to the new task. In this paper we investigate different methods for onlineadaptation of language m...

متن کامل

Online Adaptation for Language Models in Spoken Dialogue Systems

The robust estimation of language models for new applications of spoken dialogue systems often suffers from a shortcoming of training material. An alternative to training a language model is to improve an initial language model using material obtained while running the new system, thus adapting it to the new task. In this paper we investigate different methods for onlineadaptation of language m...

متن کامل

Three Approaches to Understanding and Classifying Mental Disorder: ICD-11, DSM-5, and the National Institute of Mental Health’s Research Domain Criteria (RDoC)

The classification of mental disorders has long been the subject of controversy among mental health professionals. Despite a Significant expansion of knowledge about mental disorders during the past half century, understanding of their processes and components remains rudimentary. This article provides descriptions of three systems with different purposes relevant to understanding and classifyi...

متن کامل

Porting the Galaxy System to Mandarin Chinese

Galaxy is a human-computer conversational system that provides a spoken language interface for accessing on-line information. It was initially implemented for English in travel-related domains, including air travel, local city navigation, and weather. Efforts were started to develop multilingual systems within the framework of galaxy several years ago. This thesis focuses on developing the Mand...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009